Search CORE

31 research outputs found

Of Cores: A Partial-Exploration Framework for Markov Decision Processes

Author: Meggendorfer Tobias
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 30th International Conference on Concurrency Theory (CONCUR 2019)
Publication date: 01/01/2019
Field of study

We introduce a framework for approximate analysis of Markov decision processes (MDP) with bounded-, unbounded-, and infinite-horizon properties. The main idea is to identify a "core" of an MDP, i.e., a subsystem where we provably remain with high probability, and to avoid computation on the less relevant rest of the state space. Although we identify the core using simulations and statistical techniques, it allows for rigorous error bounds in the analysis. Consequently, we obtain efficient analysis algorithms based on partial exploration for various settings, including the challenging case of strongly connected systems

Dagstuhl Research Online Publication Server

Anytime Guarantees for Reachability in Uncountable Markov Decision Processes

Author: Grover Kush
Meggendorfer Tobias
Weininger Maximilian
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 33rd International Conference on Concurrency Theory (CONCUR 2022)
Publication date: 01/01/2022
Field of study

We consider the problem of approximating the reachability probabilities in Markov decision processes (MDP) with uncountable (continuous) state and action spaces. While there are algorithms that, for special classes of such MDP, provide a sequence of approximations converging to the true value in the limit, our aim is to obtain an algorithm with guarantees on the precision of the approximation. As this problem is undecidable in general, assumptions on the MDP are necessary. Our main contribution is to identify sufficient assumptions that are as weak as possible, thus approaching the "boundary" of which systems can be correctly and reliably analyzed. To this end, we also argue why each of our assumptions is necessary for algorithms based on processing finitely many observations. We present two solution variants. The first one provides converging lower bounds under weaker assumptions than typical ones from previous works concerned with guarantees. The second one then utilizes stronger assumptions to additionally provide converging upper bounds. Altogether, we obtain an anytime algorithm, i.e. yielding a sequence of approximants with known and iteratively improving precision, converging to the true value in the limit. Besides, due to the generality of our assumptions, our algorithms are very general templates, readily allowing for various heuristics from literature in contrast to, e.g., a specific discretization algorithm. Our theoretical contribution thus paves the way for future practical improvements without sacrificing correctness guarantees

Dagstuhl Research Online Publication Server

Index appearance record with preorders

Author: Kretinsky Jan
Meggendorfer Tobias
Waldmann Clara
Weininger Maximilian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Transforming ω-automata into parity automata is traditionally done using appearance records. We present an efficient variant of this idea, tailored to Rabin automata, and several optimizations applicable to all appearance records. We compare the methods experimentally and show that our method produces significantly smaller automata than previous approaches

IST Austria: PubRep (Institute of Science and Technology)

An Anytime Algorithm for Reachability on Uncountable MDP

Author: Grover Kush
Křetínský Jan
Meggendorfer Tobias
Weininger Maximilian
Publication venue
Publication date: 10/08/2020
Field of study

We provide an algorithm for reachability on Markov decision processes with uncountable state and action spaces, which, under mild assumptions, approximates the optimal value to any desired precision. It is the first such anytime algorithm, meaning that at any point in time it can return the current approximation with its precision. Moreover, it simultaneously is the first algorithm able to utilize \emph{learning} approaches without sacrificing guarantees and it further allows for combination with existing heuristics

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

IST Austria: PubRep (Institute of Science and Technology)

Entropic Risk for Turn-Based Stochastic Games

Author: Baier Christel
Chatterjee Krishnendu
Meggendorfer Tobias
Piribauer Jakob
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 48th International Symposium on Mathematical Foundations of Computer Science (MFCS 2023)
Publication date: 01/01/2023
Field of study

Entropic risk (ERisk) is an established risk measure in finance, quantifying risk by an exponential re-weighting of rewards. We study ERisk for the first time in the context of turn-based stochastic games with the total reward objective. This gives rise to an objective function that demands the control of systems in a risk-averse manner. We show that the resulting games are determined and, in particular, admit optimal memoryless deterministic strategies. This contrasts risk measures that previously have been considered in the special case of Markov decision processes and that require randomization and/or memory. We provide several results on the decidability and the computational complexity of the threshold problem, i.e. whether the optimal value of ERisk exceeds a given threshold. In the most general case, the problem is decidable subject to Shanuel’s conjecture. If all inputs are rational, the resulting threshold problem can be solved using algebraic numbers, leading to decidability via a polynomial-time reduction to the existential theory of the reals. Further restrictions on the encoding of the input allow the solution of the threshold problem in NP∩coNP. Finally, an approximation algorithm for the optimal value of ERisk is provided

Dagstuhl Research Online Publication Server

IST Austria: PubRep (Institute of Science and Technology)